An Invariants-based Method for Efficient Identification of Hybrid Species From Large-scale Genomic Data

نویسندگان

  • Laura S. Kubatko
  • Julia Chifman
چکیده

— Coalescent-based species tree inference has become widely used in the analysis of genome-scale multilocus and SNP datasets when the goal is inference of a species-level phylogeny. However, numerous evolutionary processes are known to violate the assumptions of a coalescence-only model and complicate inference of the species tree. One such process is hybrid speciation, in which a species shares its ancestry with two distinct species. Although many methods have been proposed to detect hybrid speciation, only a few have considered both hybridization and coalescence in a unified framework, and these are generally limited to the setting in which putative hybrid species must be identified in advance. Here we propose a method that can examine genome-scale data for a large number of taxa and detect those taxa that may have arisen via hybridization, as well as their potential “parental” taxa. The method is based on a model that considers both coalescence and hybridization together, and uses phylogenetic invariants to construct a test that scales well in terms of computational time for both the number of taxa and the amount of sequence data. We test the method using simulated data for up 20 taxa and 100,000bp, and find that the method accurately identifies both recent and ancient hybrid species in less than 30 seconds. We apply the method to two empirical datasets, one composed of Sistrurus rattlesnakes for which hybrid speciation is not supported by previous work, and one consisting of several species of Heliconius butterflies for which some evidence of hybrid speciation has been previously found. (

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

PCR-restriction enzyme method based on the polymorphism in ribosomal-DNA for identification of the most important dermatophyte species in Iran

Background and aim: Dermatophytosis (tinea, ringworm) is the infection of skin, hair or nail that is caused by various keratinophilic fungi (dermatophytes). Dermatophytosis is a common infection throughout the world including all parts of Iran. As conventional laboratory procedures for identification of different dermatophytes are slow or lack specificity, more rapid and reliable methods are st...

متن کامل

An efficient and simple CTAB based method for total genomic DNA isolation from low amounts of aquatic plants leaves with a high level of secondary metabolites

An efficient DNA isolation protocol specifically modified to get pure quality DNA required for molecular studieshas been reported in this paper. Some aquatic plants (Potamogeton spp., Ceratophyllum demersum and Myriophyllum spicatum) were used for the study. The protocol developed will be useful in getting high and pure DNA. Instead of using the available DNA extraction kits, this protocol can ...

متن کامل

A NEW HYBRID ALGORITHM FOR TOPOLOGY OPTIMIZATION OF DOUBLE LAYER GRIGS

In this paper, for topology optimization of double layer grids, an efficient optimization method is presented by combination of Imperialist Competitive Algorithm (ICA) and Gravitational Search Algorithm (GSA) which is called ICA-GSA method. The present hybrid method is based on ICA but the moving of countries toward their relevant imperialist is done using the la...

متن کامل

Application of Recursive Least Squares to Efficient Blunder Detection in Linear Models

In many geodetic applications a large number of observations are being measured to estimate the unknown parameters. The unbiasedness property of the estimated parameters is only ensured if there is no bias (e.g. systematic effect) or falsifying observations, which are also known as outliers. One of the most important steps towards obtaining a coherent analysis for the parameter estimation is th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015